Justsystem at NTCIR-5 Patent Classification
نویسندگان
چکیده
Justsystem participated in Patent Classification Subtask at the Fifth NTCIR workshop. This paper overviews our machine learning-based patent application classification system. Straightforward application of Naive Bayes classifier was effective in theme categorization subtask that has a non-hierarchical category structure. In F-term categorization subtask, we regarded the complicated F-term categorization system as a tree with depth 2. We constructed the document classifier based on the Support Vector Machine and classify documents on this tree. Platt’s sigmoid fitting for SVM output was used for the document ranking. We confirmed that this method was effective for this subtask.
منابع مشابه
Notes on the Limits of CLIR Effectiveness: NTCIR-2 Evaluation Experiments at Justsystem
NTCIR-2 evaluation experiments at the Justsystem site are described with a focus on comparative study of CLIR effectiveness with monolingual retrieval effectiveness of the same retrieval engine. Experiments on the effects of phrasal translation, indexing of translated phrasal terms, pre-translation feedback and parallel documents feedback in diverse retrieval settings, are reported. The results...
متن کاملOverview of Classification Subtask at NTCIR-5 Patent Retrieval Task
This paper describes Classification Subtask at NTCIR-5 Patent Retrieval Task. We perform two subtasks for patent classification using a multi-dimensional classification structure called “F-term (File Forming Term) classification system”. The first one is Theme Categorization Subtask, where each participant classifies a patent into technological fields called themes. The second one is F-term Cat...
متن کاملOverview of Classification Subtask at NTCIR-6 Patent Retrieval Task
This paper describes the Classification Subtask of the NTCIR-5 Patent Retrieval Task. The purpose of this subtask is to evaluate the methods of classifying patents into multi-dimensional classification structures called F-term (File Forming Term) classification systems. We report on how this subtask was designed, the test collection released, and the results of the evaluation.
متن کاملOverview of Patent Retrieval Task at NTCIR-5
In the Fifth NTCIR Workshop, we organized the Patent Retrieval Task and performed three subtasks; Document Retrieval, Passage Retrieval, and Classification. This paper describes the Document Retrieval Subtask and Passage Retrieval Subtask, both of which were intended for patent-to-patent invalidity search task. We show the evaluation results of the groups participating in those subtasks.
متن کاملJustsystem-Clairvoyance CLIR Experiments at NTCIR-4 Workshop
At the NTCIR-4 workshop, Justsystem Corporation and Clairvoyance Corporation collaborated in participating in the Cross-Language Retrieval Task (CLIR). We submitted results to the sub-tracks of SLIR and BLIR. For the SLIR track, we submitted Chinese, English, and Japanese monolingual runs. For the BLIR track, we submitted Japanese-English and Chinese-English runs. The major goal of our particip...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005